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q _ We consider two-user multiple access games in which one player (primary user) is interested in maximizing its 

1 confidential data rate at the minimum possible transmission power and the other player (secondary, cognitive user) 

\£) ' employs eavesdropping as a leverage to maximize its data rate to a common destination at minimum transmission 

energy. For the non-cooperative static game, Nash equilibria in pure and mixed strategies are derived and shown to 
be Pareto inefficient for certain ranges of channel gains and energy cost parameters, when channel gains are common 
knowledge. A Stackelberg game formulation is then considered where the primary user is the leader. Interestingly, 
it is shown that the secondary user is forced to play as the follower where the Stackelberg equilibrium dominates 
_ the Nash equilibrium, even if the eavesdropper channel is better than the primary channel. Here, the Stackelberg 

utility achieved Pareto dominates the achieved Nash utility. Finally we study the unknown eavesdropper channel case 
numerically where the primary user has only statistical knowledge about the channel gain. We compare the results 
to the first scenario and show that it is not always beneficial for the cognitive user to hide the actual eavesdropper 



lO ■ channel gain. 

o 
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I. Introduction 

X: 

Research in Cognitive Radio Networks (CRNs) is motivated by wireless spectrum scarcity with respect to the 
ever growing demand of spectrum resources (T). Recent studies show that the spectrum is inefficiently utilized by 
current licensees 12. Cognitive Radio has been proposed as means of solving this problem. 

Game theory has been extensively employed in the analysis of Wireless Networks problems in general and CRNs 
in particular 0, 0. Specifically, the Multiple Access Channel (MAC) is one of the basic channel models that have 
been well studied using game theoretical techniques. For example, in 0, a multiple access game is considered 
where a coordination signal is used to determine the order of successive interference cancellation at the destination. 
Design of resource allocation algorithms for fading multiple access channels is studied in |6j] using Nash and 
Stackelberg equilibrium concepts. In (7), cooperative random access and cooperative token ring are studied using 
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coalitional game theory. A comprehensive survey on game theoretic approaches for interference free multiple access 
in wireless networks, where different users transmissions are orthogonal, is presented in (8). 

Due to its broadcast nature, a large challenge in wireless networks is security. Thus issue is even more stressed 
in cognitive radio settings. Adversarial activity is usually adopted by users to cause damage to the attacked system. 
As an example, malicious activity in a CRN was previously studied in where the malicious users are attackers 
aiming to degrade performance of the primary system through different methods like routing disruption and traffic 
injection. In IflOl , a Stackelberg game is studied in which transmissions of a trusted cognitive users can improve 
secrecy of primary users (PUs) with respect to an external eavesdropper. Other examples of security attacks in 
CRNs can be found in (9) as well. 

In this paper, we consider adversarial activity where cognitive users have a different goal. Specifically, secondary 
users (SUs) can employ their capability (e.g., eavesdropping) as a leverage to maximize their own performance: 
the attack may be used as a threat to the primary system. In our model, SuJ3 gain access to the spectrum through 
leveraging their eavesdropping capability. In a system where both PUs and SUs transmit to a common destination 
(e.g., base station or access point), SUs threaten the primary system by compromising the privacy of the primary 
traffic through eavesdropping. This threat possibly forces PUs to lower their transmission power levels to minimize 
the amount of eavesdropped packets. Consequently, SUs can achieve higher rates since interference from primary to 
secondary transmission will be lowered, as well. We employ physical layer (information theoretic) security measures 
ifTTl and use game theoretical analysis to study this adversarial situation. We characterize equilibrium points and 
hence the optimal resource allocation for each of the primary and secondary systems. 

To the best of the authors knowledge, this work is the first to consider leveraging eavesdropping capabilities to 
access channel in a cognitive radio setting. In our model, the PU is interested in maximizing its secure (confidential) 
data rate while transmitting at the minimum possible power. The SU wishes to transmit best effort traffic to a common 
destination D while at the same time minimizing its energy consumption. To this end, SU threatens to eavesdrop 
the transmission of PU to increase its utility. In particular, in our model, SU is equipped with a half duplex wireless 
transceiver and it employs a time division scheme where the available time is divided between transmitting its own 
information and eavesdropping the information transmitted by PU, subject to a linear energy cost. The contributions 
of the paper are as follows. First, Nash Equilibrium (NE) is characterized for the static game in both pure and 
mixed strategies. For certain ranges of the channel coefficients and energy cost parameters, the equilibrium point 
is shown to be inefficient and reveals a lose-lose situation. A leader-follower game is then formulated in which PU 
is the leader who specifies its strategy and then the follower reacts so that its utility is maximized. In this case, 
Stackelberg Equilibrium (SE) is characterized and shown to dominate, in the sense of Simaan lfl2l . the NE and 
hence the follower is forced to comply with this strategy. In these games, we first assume that all channel gains 
are common knowledge to both players. Then, we analyze the more realistic scenario when PU has only statistical 
knowledge about the channel gain of the eavesdropper. Here, the cognitive user has the ability to either hide or 

'We use the terms 'user' and 'player'; as well as 'secondary' and 'cognitive' interchangeably throughout the paper. 
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reveal the actual value of the eavesdropper channel coefficient. We show that its not always beneficial for SU to 
hide this information from PU, especially when secondary channel gains are low. 

The rest of the paper is organized as follows. Section [TT] provides the required background from game theory 
and information theory. In Section [TTH we present our game setup and derive its NE for all values of channel 
conditions and energy cost parameters. In addition, we show via examples how the derived equilibrium points can 
be inefficient. Stackelberg formulation is then considered in Section [TV] and the SE is shown to Pareto dominate 
the NE for all ranges of channel coefficients when PU is the leader. The effect of unknown eavesdropper channel 
at PU is then considered in Section [V] Finally, the paper is concluded in Section [VI] 

II. Background 

In this section, we review results from information theory about the multiple access channel and the wiretap 
channel that we need in our game formulation. Moreover, we present certain definitions from non-cooperative game 
theory that are essential in our analysis. 

A. Multiple Access Channel and Wire-tap Channel 




x 



I 1 Ri 

C(aPi) 

Fig. 1. The capacity region of the Gaussian MAC. 

The two-user multiple access channel is a well known channel model in the network information theory ff3l . 
Let the channel capacity function be defined as C{x) = \ log(l + x), where logarithms are taken to the base 2. 
The capacity region of a channel defines the achievable rates so that receiver can decode the information reliably, 
i.e., with an arbitrarily small probability of decoding error. For a two-user Additive White Gaussian Noise (AWGN) 
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Multiple Access Channel (MAC), the capacity region is a pentagon given by 

Ri(Pi) < C(aPi), 
R2(P 2 ) < C(cP 2 ) 

Ri{Pi) + i? 2 (P 2 ) < C(aP 1 +bP 2 ) 1 (1) 

where Ri(-), -R 2 (-) are the achievable rates for transmitters 1 and 2, respectively, Pi, P 2 are the transmission power 
levels, a > 0, c > are the (constant) channel power gains and noise is assumed to have unit variance. The capacity 
region (Q]) is shown in Figure Q] for the channel model depicted in the upper part of Figure |2j where we assign 
user 1 to be PU and user 2 to be SU. The two corner points of the capacity region are achieved by successive 
interference cancellation where the order of decoding at the destination determines the corner point |fl3l . 

In this paper, we employ rate expressions on the boundary of the region ([T). Specifically, we assume that the 
destination operates at point x in Figure Q] i.e., it always decodes SU first in the interference cancellation decoder 
and hence gives priority to PU. Consequently, the achievable rates at the destination for PU and SU are given by 

R 1 (P 1 ) = C{aP 1 ) 

fl 2 (P 2 ) = C(-^-). (2) 
1 + aP\ 

We note that rate pairs on the boundary of the capacity region (f2]i can only be achieved when both transmitters 
coordinate the codebooks and rates used in the channel coding |fl3l . If no coordination is assumed, then interference 
of each signal on the other will affect achieved rates of both users. 

In the presence of an eavesdropper, the achievable secrecy rate of a transmitter is a the rate at which the entropy 
of the sender's message at the eavesdropper is arbitrarily close to the entropy of the message itself, given the 
received signals. In other words, the secure rate is the rate at which the message of sender is almost independent 
from the received signals at the eavesdropper. Achievability schemes (i.e., channel coding) are designed to maximize 
the confusion at the eavesdropper while maximizing the reliable rate at the legitimate receiver by exploiting the 
wireless channel characteristics such as noise and fading. For the Gaussian channel, the secrecy capacity is given 

by ma 

R s {P)=C{aP)-C{bP) (3) 

for a > b and R S (P) = otherwise, where a, b > are the channel gains of the legitimate receiver's channel and 
the eavesdropper channel, respectively. The wiretap channel is sketched in the lower part of Figure [2] 

In our game, we assume that SU employs a half duplex transceiver and can either transmit to D or eavesdrop 
the transmission of PU at any given time. Thus, the channel model during SU's transmission is a multiple access 
channel, while it is a wiretap channel during eavesdropping, as shown in Figure [2] Throughout the paper, we refer 
to the channel between PU and D as the primary channel, the channel between SU and D as the secondary channel, 
and the channel between PU and SU as the eavesdropper channel. 
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Fig. 2. Channel model. 

Our network model is appropriate for CRNs for multiple reasons. First, when SU is transmitting information to 
the common destination D, it is given a lower priority than PU. This is clear from the achievable rate (0: the 
transmission of PU causes interference to SU and hence SU's achievable rate decreases with increasing transmission 
power of PU. Through the threat of SU, PU may be forced to decrease its transmission power Pi and hence SU 
achieves higher data rate, as will be discussed in the next section. Finally, as discussed in Section IIV1 the analysis 
reveals that SU is forced to follow PU in a leader-follower game. 

B. Game Theory Basics 

Game theory provides an analytical framework to analyze situations of conflict between multiple decision makers 
that are rational, intelligent and selfish. These attributes accurately characterize wireless devices designed to 
optimize their own performance. Here, we borrow definitions from lfT31l and lfl2l that are needed for equilibrium 
analysis in the following sections. A strategic game is any Q of the form Q = (Af, (Si)i^j\f, (iii)i£Af)> where AT 
is the set of players in the game. Let the utility of a player be given by ttj(s,-, s_j) where Sj € <S, is the pure 
(deterministic) strategy (or action) of player i chosen from the set of available strategies 5, and s_j is the strategy 
profile of all other players except for player i chosen from Xj e j l /_u\Sj. In the following definitions, we focus on 
two-player games, i.e., Af = {1,2}. 
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Definition 1: An NE point is a strategy pair [s\, s 2 ) such that 

Ul(8*,S2) > Ui(si,S2),Vsi G Si 

U 2 (sl,S 2 ) >U 2 (sl,S 2 ),\/s 2 E S 2 . (4) 

This definition implies that at an NE, no user has incentive to unilaterally deviate to other operating points. Assume 
there exist two well defined unique mappings T\ : S 2 — > Si and T 2 : S\ — > S 2 such that for any fixed s 2 G S 2 , 
ui(Ti(s 2 ),s 2 ) > Ui(si,s 2 ),Vsi G Si and for any fixed s x G <Si, u 2 (si, ^(si)) > u 2 (si, s 2 ), Vs 2 G 5 2 , i.e., T- 
defines strategies that are best response to each strategy chosen by the other player. Let the set D{ = {(si,s 2 ) G 
Si x S 2 : si = Ti(sj)} for i = l,j = 2 and i = 2,j = 1 be called the rational reaction set of player i and let 
Di(sj) = {si G Si : (si,Sj) E Di}. Note that any pair in the set Di n D 2 is an NE according to Definition [T] 
Hence, a strategy profile S is an NE if and only if the strategy of every player in S is a best response to the other 
player's strategy. 

When mixed strategies are allowed, Definition Q] is written for expected utilities instead. Let /j : <Sj —¥ [0, 1] be 
a mixed strategy for player i which defines a probability density function over s, such that /j(Sj) > OVs, G <!>,. 
Here, we define supp(fi) as the set of actions for player i with positive probability in the mixed strategy /,-, that 
is, supp(fi) = {s G <Sj : fi(s) > 0}. At any Mixed Strategy Nash Equilibrium (MSNE), strictly dominated actions 
can not be assigned positive probability fl6l . Here, an action Sj G Si for player i is strictly dominated if there 
exists a mixed strategy /j such that E[u,(/j, Sj)] > Uj(si, Sj) Vsj G 5j, j 7^ i. The following proposition helps in 
the characterization of MSNE lfl6l . 

Proposition 1: A mixed strategy profile /* = (/*, f 2 ) is an MSNE if and only if for each player i there exists 
some Ci G 7Z such that 

E[ui(si,/ 3 -)] =Ci, Vsj G supp(fi), (5) 

E ["i(Sj,/j)] < c i; ^ SUpp(fi), i J^j. (6) 

That is, at an MSNE, each player chooses the support for its mixed strategy so that it contains only pure strategies 
that leads to best response to the other players strategies. One important result about MSNE is that for any continuous 
game, i.e., game with continuous utility functions, there exists at least one NE point in mixed strategies lfl6l . 

A generalization of the strategic form game is the Bayesian game fTTl where some players have private information 
about the game that other players do not have. A Bayesian game takes the form 

Qb = (A/ - , (Si) ieA f, (7^) 4e Ar, (Pt)teAT) («t)teAf) ( 7 ) 

where % is the set of types of the i th player that specifies the information player i only knows about the game and 
Pi is the probability function specifying what player i believes about the other players' types given its own type. 
Here, it is assumed that each player i knows the structure of Qb and its own type £j G %. A Bayesian Equilibrium 
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(BE) is an NE for Qt, such that each player maximizes its expected utility. Therefore, the Bayesian game assumes 
that users are risk neutral. 

The last type of game formulations we employ is Stackelberg games. In a Stackelberg game, we have a leader 
that makes a decision about its own strategy and followers who then choose their strategies accordingly. In the 
following definitions, we fix player 1 as the leader and player 2 as the follower. The leader chooses the strategy 
that maximizes its utility from the rational reaction set of the follower. 

Definition 2: A Stackelberg equilibrium strategy is a strategy pair (si,s 2 ) £ D 2 such that 

ui(s!,s 2 ) >u 1 (s 1 ,s 2 ), (s 1 ,s 2 )eD 2 . (8) 

In this paper, we sometimes use the shorthand Stackelberg Equilibrium (SE) to mean Stackelberg equilibrium 
strategy. We also use the shorthand SEP (respectively SES) to indicate an SE with the primary user (respectively 
secondary user) as the leader. 

The utility of the leader is a well defined quantity lfT31 and is given by 

u\ = sup inf u\{si,s 2 ). (9) 

steSi s 2 eD 2 {si) 

A Stackelberg equilibrium strategy for the leader may not exist in general 11511 . In this case, however, an e-SE can 
possibly exist in which the leader achieves utility e close to u\. 

Definition 3: Let e > be a given real number. Then, a strategy si c E Si is called an e-Stackelberg equilibrium 
strategy for the leader if 

inf ui(si e ,s 2 ) > ui — e. (10) 
s 2 eD 2 (s le ) 

One important property is that an e-SE always exists in a game Q if u\ is finite lfl"5l . 

From Definitions [T] |2] it can be seen that the utility achieved by a user in a Stackelberg game under its own 
leadership is always at least as good as the utility achieved under any NE for the same game 031 . This fact 
motivates the following definition. 

Definition 4: A Stackelberg equilibrium strategy (si,s 2 ) with player 1 as the leader is said to dominate an NE 

(*i.*s)tf 

u 2 (si,s 2 ) > u 2 (sl,s 2 ). (11) 

In case (fTTT i is true, both the leader and the follower would better choose to play the Stackelberg game under the 
leadership of player 1 . In Section [IV] this property will be vital to show that SU accepts to be a follower in a 
Stackelberg game that leads to more efficient performance for both players. 
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III. Cognitive Eavesdropper Nash Game 

We consider a two-player static non-cooperative game Q where the players are PU and SU. In this game, the 
strategy of PU is to select transmission power level s% = P\ € [0, Pi], while the strategy of SU is to choose a 
fraction S2 = a € [0, 1] by which it divides the total available time T into transmission time aT and eavesdropping 
time (1 — a)T. Without loss of generality, we assume T = 1. Note that PU is transmitting all the time while SU 
is either transmitting its own data or eavesdropping the primary traffic. 

PU is interested in maximizing its secure rate to the destination at the minimum power cost. The utility function 
of PU is given by 

ui(Pi,a) = C{aPx) - (1 - a)C(M\) - 7P1 (12) 

where 7 > is the unit power cost. The second term in ( fl2] i reflects the rate eavesdropped by the cognitive user. 
For example, when a = 1, SU is transmitting all the time and the eavesdropped rate is zero. Note that there is no 
interference term due to transmission of SU since we consider the optimal multiple access scheme in which the 
traffic of SU is decoded first and then subtracted from the received signal using interference cancellation. 

We assume that SU is bounded by a maximum power constraint P-x. SU uses P2 as its fixed transmission power 
level over the entire transmission period. In addition, we assume that SU is penalized for energy consumption using 
a linear cost function. The utility function of SU is thus given by 

Vai P l ,a) = a(c( J f^)-0) (13) 

where f3 > is the energy cost per unit transmission time. In this section and in Section |IV] we assume that 
a, b, c, 7, /3, P2, Pi are common knowledge. Hence, the games considered in these sections are non-cooperative 
static games with complete information. The goal of each user is to maximize its own utility by selecting the 
appropriate strategy given the knowledge of the other user's utility function. 

The following notation will be useful in our analysis of the game Q. We define P*(a) as PU's power level that 
maximizes u 1 (Pi,a) for a given a 6 [0, 1]. Hence, P*(0) and P*(l) are the primary power levels that maximize 
the functions ui(Pi,0) and u\{P\, 1), respectively. Also we define the threshold power level Q as 

1 f cP 2 \ 

where the slope of the function U2(Pi,a) is positive if Pi < Q. 

To ease our characterization of the NE of Q, the result is separated into two cases. In the first, the primary 
channel is assumed to be stronger than the eavesdropper channel, i.e., a > b. Here, we focus our analysis only 
on pure strategies. This is made possible without any loss of generality since the strategy sets are convex and the 
utility functions are concave in the corresponding variables |fT31l in this case, as will be given in Theorem Q] In the 
second case, the complementary case a < b is considered. In this case, mixed strategy Nash equilibria may exist 
and hence an extended argument is required. The result of this scenario is presented in Theorem [2] The following 
Lemma characterizes the structure of the function Mi(Pi,a). 
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Lemma 1: For a given a, P*(a) maximizes the function ui(Pi,a) with respect to Pi, where 

P*(a)= argmax ui(P\,a) (15) 

Pie{0,P'(a)} 

P (a) = argmax ui(Pi,a) = (16) 



P 1 G[P(a),P 1 ] 



and 



P & " 6(f^) 
ab(l — v 1 — a) 

X(a) = aa6- 7 (a + o) (18) 
F(a) = 4706(7 - a + 6(1 - a)) (19) 
Proof: Given some strategy a of SU, the best response for PU, i.e., P*(a) is given as 

P* (a) = arg max ui (Pi , a) . (20) 

Pie[o,Pi] 

Using the second derivative test, it can be shown that ui(Pi,a) is concave in Pi if Pi > P(a) and convex 
otherwise. For the concave region (i.e., Pi € [P(a), Pi]) and by setting the first derivative of Ui(-) with respect to 
Pi to zero and solving for Pi, we get the expression for P'(a). 

For the convex region, the maximum is on the boundary of the interval [0, P(ot)]. Then, from the continuity of 
ui(-), the expression for P*(a) follows and the proof is complete. ■ 

The following Theorem characterizes the unique NE of the game Q for the case a > b. 

Theorem 1: For the game Q, if a > b, then the unique NE is 

'(P'(0),0); ifQ<P'(0) 

( S 14) = UQ,a Q ); if P'(0) <Q< P'(l) (21) 

(P'(l),l); ifP'(l)<Q. 

where ctQ £ [0, 1] is the time fraction of SU that solves the equation P'(a) = Q and 7 = 7ln(4). 

Proof: We show that the intersection of the best response correspondences of PU and SU are exactly the points 
in the Theorem. Therefore, no user has incentive to deviate unilaterally from such points and the conditions of 
Definition Q] are satisfied at these given points. 

From Lemma [U note that when a > b, P(a) < Va G [0, 1] and hence Pi > P(a) for all Pi g [0, Pi] and 
P*(a) = P'(a). 

The utility of SU U2(Pi,a) is linear in a given Pi and the slope of U2(Pi,a) is non-positive for Pi > Q. 
Consider the case Q < P'(0) and assume NE is at a = 0. Then, the best response for the primary player is 
Pi = P'(0) > Q implying that (P'(0), 0) is an NE in this case. This proves the first case. 

Now assume a* = 1. Then, the best response for the primary is Pi = P'(l). When P'(l) < Q, the best response 
for SU when Pi = P'(l) is a = 1 and hence (P'(l), 1) is an NE in this case. 
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For the remaining case, i.e., P'(0) < Q < P'(0), we note that SU is indifferent to the choice of a when PU 
chooses P\ = Q since the slope of U2(Q, a) with respect to a is zero in this case. The intersection of the best 
response sets for the PU and SU is at a* = oiq. The solution chq to the equation P'(a) = Q is given by 



The parameter aq is in the interval [0, 1] if and only if P'(0) < Q < P'(l) implying that it is the only NE in this 
case. Finally, note that P'(a) is an increasing function in a. Therefore, the relation P'(0) < P'(l) is always valid 



The unique NE for the game Q suggests the following. SU's decision depends on the choice of power level of 
PU with respect to the threshold value Q. Only when Pi < Q, SU is able to achieve positive utility. If the threshold 
Q is high so that P*(l) < Q (the last case in the theorem), then SU transmits all the time and the second term in 
the primary utility function vanishes. When a = 1, we have 



An example of such a case is when the primary channel gain a is small and the secondary channel gain c is high 
for a given f3. This can be seen from ( TBI and ((23). In this case, interference from PU to SU at the destination 
is low and the secondary chooses to transmit for the entire available time. In addition, PU is not affected by the 
transmission of SU. 

When c is small and a is large, we might have Q < P*(l). In this case, SU achieves zero utility (the remaining 
two cases in the theorem). In fact, SU is forced to select a = in order to avoid negative utility in these cases. 

Now we present a similar result for the case a < b. We start by computing the best response correspondences 
for each user. It is straight forward to see that for SU, T 2 (Pi) = 1 if Pi < Q, T 2 {P\) = if Pi > Q and 
IMPl) = a, a € [0,1] if Pi = Q, similar to the first scenario. For PU, note that P'(a) is an increasing 
function of a. To find Ti(a), the following definition is needed. Let a be defined such that Ui(P'(a), a) = 
and ui(P'(a),a) > Ma > a. From ( TBi we have Ti(a) = if a e [0,a), Pi (a) = P'(a) if a £ (a, 1], and 
Ti(a) = {0, P'(&)}. Note that if a < then P'(a) = P*(a) which is similar to the case when a > b. If a > 1, 
then ui(-) < for all P\ and a, implying that the decision of PU is P\ = 0. 

The following theorem characterizes the Nash equilibria when a < b for all cases of channel conditions and 
energy cost parameters. In the following, pi(x) denotes the probability PU uses the (discrete) action x G [0, Pi] in 
some mixed strategy. 



olq = 



7 [Q(a + b + abQ) + 1] - a + b 
b(aQ + l) 



(22) 



and we do not need to consider other cases. This concludes the proof. 




(23) 



11 



Theorem 2: For the game Q, if a < b, the following are the only Nash equilibrium points 



(0,0); 



if Q < 



(0,gi); 



if Q = 



(/2,02) ; 



if < Q < P' 



(a) 



(24) 



(Q,93); 



if P' 



(5) < Q < P'(i) 



if P'(l) < Q. 



where gi is the mixed strategy for SU with an arbitrary probability distribution over supp(gi) = [0,a], f 2 is the 
mixed strategy for PU with supp{f 2 ) = {0, P'{a)} and pi(0), pi(P'(a)) are the unique solution to the linear 
equations 



^2, 53 are the mixed strategy for SU such that supp(g 2 ) = supp{g2) = [0, 1] and E[a] = a, E[a] = aq , respectively. 

Proof: In the first case, v,2(Pi,ot) — Va > 0. Then the best response of the secondary user is a* = VPi. 
Since a < b, then ui(Pi,0) < VPi > implying that P* = 0. This proves the first case. 

For the second case, the intersection of the best response correspondences is (0,a),a £ [0,5], and hence we 
have an infinite number of pure strategy Nash equilibria. For any MSNE (/',<?')> we have expected value of the 
slope of u 2 {-) is zero implying that supp(f') = {0}. Hence, g' can have any distribution on [0,a]. Note that all 
equilibria in this case have the property that E[iii(-)] = E[it 2 (-)] = 0. 

For the third case, the intersection of the best response sets is empty and there is no pure strategy Nash equilibrium. 
To find the support of the mixed strategy of PU, note that Pi G {(0, P'(a))}\J{P\ > P'(l)} is never a best response 
to any strategy of SU and hence it can be discarded from the support. Using properties of MSNE from Section Hl-BI 
and the fact that there exists at least one mixed strategy Nash equilibrium, then for some a' £ [0,1], there exists 
some mixed primary strategy /' such that Ui(Pi, a') = K VPi € supp(f'), where K is some constant. Observing 
the best response set of PU, it can be seen that a' = a is the only SU strategy satisfying this condition if support 
of SU is singleton. In addition, supp(f') = {0, P'(a)} and K = 0. To find /', SU must has no incentive to deviate 
from a = a, i.e., E[tt2(/', a)] = 0. It can also be seen that for any distribution g' of a on [0, 1] such that E[a] = a, 
(f',g') is an MSNE. This imply the third case in Theorem. 

For the fourth case, the intersection of Xi(-) and T^-) implies the unique pure NE point (Q,ctQ). It can also 
be seen that if SU randomizes its action over the interval [0, 1] such that E[a] = uq, then best response of PU is 
Pi = Q. Any other (mixed) strategy for SU will not result in a NE. 

Finally, when Q > P'(l), then a* — 1 for all Pi in the rational reaction set of the PU. In addition, when 
Pi = P'(l), no player has incentive to deviate. This concludes the proof. ■ 

We now discuss interesting properties of the Nash equilibria of Q that motivates our investigation of the 



pi(0) > 0, pi(P'(a)) > 0, pi(0) +pi(P'(5)) = 1 




(25) 
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Stackelberg game and lead to the main result of the paper. While the following discussion holds for both considered 
scenarios of channel conditions, we only focus on the scenario a > b for brevity. 

We observe that, in the cases where Q < P'(l), the NE point is inefficient: there may be other operating points 
where at least one player achieves higher utility while the utility of the others is not decreased. For example, assume 
< Q. In the first case where Q < P'(0), if PU chooses a power level less than but arbitrarily close to Q, then 
SU is willing to transmit all time, i.e., chooses a = 1. In this case, SU achieves a strictly positive utility rather 
than zero utility achieved at the NE. In addition, since the eavesdropping term in Ui(-) vanishes in this case, PU 
can achieve a better utility if U\(Q, 1) > u 1 (P'(0),0). The same argument is valid for the second case where 
P'(0) <Q< P'(l). 

To elaborate on our observation, consider the following numerical example. Let a = 2.5,6 = l,c = 3.5, P2 = 
Pi = 1, /3 = 1,7 = 1. According to (fl4] | and (fT6l l, these values imply that Q < P'(0). In this case, the utility of PU 
at the NE is ui(P'(0), 0) = 0.0211 while its utility at Pi = Q and a = 1 is Ui(Q, 1) = 0.0631 which is three times 
better. When c is changed to c = 5 while keeping the rest of parameters the same, we have P'(0) < Q < P'(l). 
Here, oiq = 0.3667, ui(Q,cxq) = 0.0681 and Ui(Q, 1) = 0.1761, which is more than two times better than the 
primary utility achieved at the NE point. The utility of PU is sketched in Figure [3] for the cases a = 0, a = oiq 
and a = 1 when c = 5. The points maximizing each case are marked in addition to the the suggested operating 
point u\(Q,l). The aforementioned operating points are not equilibrium points of the non-cooperative strategic 
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Fig. 3. Primary utility for the case P'(0) < Q < Inefficient NE marked on the middle curve and suggested operating point circled on 

the upper curve. 
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game. For instance, in the case Q < P'(0), if SU chooses a = 1, PU can take advantage of this choice and 
select Pi = -P'(l) which will cause SU to achieve negative utility. Therefore, without communication and contracts 
between players, there is no guarantee that both players will play the strategy profile (Q — e, 1) for an arbitrarily 
small e > 0. However, if both players agree to play the game with some order and not take decisions simultaneously, 
better equilibrium points can be reached. In the following section, we formulate a leader-follower game in which 
the inefficient NE points of Q are alleviated. 



In this section, we show the existence of a Stackelberg strategy under the leadership of PU that results in better 
payoff values for both players in Q compared to the payoffs achieved at NE. In a Nash game, each player chooses 
its strategy independently of the actual choice of other players. This property of a strategic game can be viewed as 
if players decide their choices simultaneously. It can also be viewed as a sequential decision process where each 
player has no information about the decisions of other players. 

On the other hand, in a Stackelberg game, the leader of the game chooses its strategy first and then announces 
it to other players in the game (followers). Then the followers react to the strategy of the leader to maximize their 
own utilities. This leader-follower scenario can model the situation when a player has the power to enforce other 
players to be followers. In addition, a rational player is willing to play the Stackelberg game as a follower if this 
implies a better utility than that achieved at the NE of the game. If all players in the game achieve higher utility in 
the Stackelberg game with the leadership of some player than that achieved at the NE of the game, then this SE is 
said to dominate the NE according to Definition |4] For our game Q, we have two possible Stackelberg games: in 
the first, PU is the leader and in the second, SU is the leader. Here, we show that an SE with PU as the leader (i.e., 
SEP) dominates the NE in Section [HI] Moreover, we show that any SE with SU as the leader (i.e., SES) can not 
dominate the NE of Q. This implies that SU is willing to be a follower in a Stackelberg game in order to achieve 
better utility values. 

To check the existence of an SE for our game, we start by computing the rational reaction set for SU, D-2- It 
can be seen that 



The following two lemmas establish the existence of an SEP and prove its dominance with respect to the NE for 
the game Q, for the two different cases of channel conditions considered in Section [ill] 

Lemma 2: For the game Q with a > b and for any given e > 0, if the channel gains a, b, c and cost parameters 
7, (3 and Pi, P2 are finite, then there exists an e-SEP. Moreover, if e is sufficiently small, then the e-SEP dominates 
the NE of Q. 

Proof: For the existence part, it suffices to show finiteness of Si as in [15]. From Definition [3] and from ( l26t . 



IV. Cognitive Eavesdropper Stackelberg game 



D 2 = {(P lf 0) : Pi > Q}U{(Pi, 1) : Pi < Q} 




(26) 
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(27) 



the utility of the PU for a Stackelberg game Q under its leadership can be calculated as 

ui(i*(0),0); ifQ<0 
max{ui(Q,l),Ui(P'(0),0)}; if < Q < P'(0) 
ui(Q,l); if P'(0) <Q< P'(l) 

u^P'il),!); ifP'(l)<Q. 
If a, 6, c and 7, /3, p, P2 are finite, then Q and P'{a) are finite for all a <G [0, 1]. Then, u\ is finite in all cases and 
existence follows from Property 4.2 in lfT31 . Now, fix some e > and consider the following strategy for PU. 



p'(oy, 



if Q < 



argmax ui(Pi,£> 2 (Pi)); if < Q < P'(0) 

Pie{Q-e,P'(0)} 



(28) 



Q-e; if P'(0) <Q < P'(l) 

P'(l); ifP'(l)<Q. 

For the first and last cases in (1281 1. it can be seen that Ui(s\ e , D2(s\ e )) = U\. In addition, for the other two cases, 
since iti(-) is uniformly continuous in P l5 it can be seen that we have U\{s\ e , D2(s\ t )) > u\ — e which holds for 
all cases of d281 l. This implies that s\ t is in fact an e-SEP by definition. Finally, to show dominance of the above 
SEP over NE of Q, note that U\ > u^sf, s^)- Then, for e sufficiently small, ui(s\ e , D2(s\ e )) is sufficiently close 
to ui. For the utility of SU, it can be seen that U2(-) is the same as in the NE for the first and last cases of (f28j) 
and U2{s\ el D2{s\ t )) > U2{s\,S2) concluding the proof. ■ 
Lemma 3: For the game Q with a < b and for any e > 0, the following strategy for the PU is an e-SEP. 

0; if Q < 

Q-e; ifO<Q<P'(l) (29) 
P'(l); ifP'(l)<Q. 

Moreover, s\ t dominates the NE of Q for small enough e. 

Proof: Follows a similar argument to the proof of Lemma [2] and is omitted for brevity. ■ 

When the game starts and according to the channel conditions and the cost parameters, PU and SU choose their 
strategies. By Definition HJ since the SEP in Lemma [2] and Lemma [3] dominates NE of Q, then it can be seen that 
SU will prefer to be the follower in a Stackelberg game under leadership of the primary than to play Nash. At the 
NE of Q, SU achieves zero utility for Q < P'(l). However, at the SEP, SU achieves a strictly positive utility value 
in the third case in d28b and non-negative utility (according to channel conditions) in the second case. 

Nevertheless, as stated in Section HU the utility of a player in a Stackelberg game under its leadership is at least 
as good as its utility in a Nash game. Consequently, SU may prefer to play a Stackelberg game under its own 
leadership and not to follow PU. The following Lemma, however, shows that for the game Q, no SES dominates 
the NE. This result shows that PU can in fact enforce SU to be a follower in a Stackelberg game. 
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Lemma 4: For the game Q, there exists no SE under the leadership of SU that dominates the NE. 

Proof: Consider the scenario a > b. We start be computing the rational reaction set D\. It can be easily seen 

that 

D\ ={(Pi,a) : P x = P'(a),a £ [0,1]}. (30) 

Now we check the SES and compare it to the NE point of Q. The SES is given by 

a = argmax U2(s\,S2) 
( Sl ,s 2 )er>i 

C P 2 

= argmaxa (C(— — — -—) - /3) (31) 
ae[o,i] l + aP'(a) 

We start by comparing to the last case in ( f2TT >. Suppose the maximizer of OTb is s 2 = a — 1. Then, si = P'(l) and 
we have Mi(sJ,s5) = iti(si,S2). If a < 1 and since ui(-) can only decrease by increasing a, then iti(s:[,s|) > 
wi(si,S2) and the SES is not dominant in this case. For the first case in (l2Tl i. it is easy to see that ui(s*,S2) = 
Ui(si, S2). Finally, for the middle case, it can be seen that a < oq implying that Ui(s*, s%) > U\(s\, S2). 

Now consider the scenario a < b. Here, D\ = {(0, a) : a £ [0, a]} U {(P*(a),a) : a £ [a, 1]}. It can be seen 
that 112(01, Di{a)) is linear and increasing for a < a and is multivalued at a = a. In addition, it is monotonically 
decreasing for a > a, with a discontinuity at a = a such that v,2(&~ , D\(a~)) > U2(a + , D\(a + )). Hence, there 
exists an e-SES at S2 = a. = a — e implying that Ui(s"i, S2) = < ui(s\, s|) at such equilibrium. This concludes 
the proof. ■ 

As given in Lemma |H at any SES of Q and comparing to the first two cases in (TZTT i. SU can choose a that leads 
to a larger secondary utility. However, this choice can only degrade the primary utility ui(-) and hence any SES 
does not dominate NE of Q according to Definition [4] The results of this section are summarized in the following 
theorem where the proof follows from Lemmas [2] [3] and [4] and the fact that PU can threaten SU to play Nash. 

Theorem 3: For the game Q, SU accepts to play as a follower and the outcome of the game is the SEP point in 
Lemma |2] and Lemma [3] 

Remark 1: Since the main result of the paper in Theorem [3] follows from the structure of the utility functions, 
i.e., linear cost function and time division scheme for SU, similar results can be derived for other threat games. For 
example, a threat game is recently considered in jTSl where SU is a jammer user that divides available transmission 
time between transmitting own information and transmitting noise symbols. 

Given that both players are rational and that both consider Nash and Stackelberg games, it is clear from Theorem 
[3] that both players will choose to play the Stackelberg game with PU as leader and SU as follower in all cases of 
channel conditions and energy cost parameters. 

It can be seen from ( fT3l that when the secondary channel is weak, i.e., c is low, the SU achieves negative utility 
if it is transmitting all the time. By threatening the PU via eavesdropping, the SU forces PU to play a Stackelberg 
game, which enables the SU to achieve a strictly positive utility. However, it is interesting to note that since the PU 
is the leader in this game, it specifies how much transmission is allowed to the SU by choosing e > 0. No matter 
how small e is chosen, SU is forced to comply with this specification. When the secondary channel is strong, i.e., 
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c is large, interference on SU from PU is negligible, and therefore SU transmits all the time. In addition, the PU 
achieves the largest possible utility in this case. 

The following is an example practical scenario for the game we consider: an SU joins an existing primary system 
and measures the received signals from PU to estimate the eavesdropper channel gain b. Assume that maximum 
power levels and energy cost are common knowledge in the game. Since D estimates the primary channel gain a, 
SU can also decode the feedback signals from D to PU to know a. Then, SU announces its presence and therefore 
D can measure the received signals from SU, estimate c and then feedback this value to both PU and SU. Here, we 
focus on the general case of asymmetrical channels where PU can not estimate b from the received signals from 
SU on the reverse channel. To this end, SU has information about a, b and c while PU has information about a 
and c only. PU can play the Stackelberg game only when it has information about b, in which case it announces 
its strategy, i.e., the primary power level and the value of e. However, SU may not want to share this private 
information with PU in order to improve its own utility. This motivates our analysis in the next section. 

V. Hidden Eavesdropper Channel 

In this section, we relax the assumption of the knowledge of the eavesdropper channel gain b at the PU that was 
used in the analysis in previous sections [HI] and [TV] Specifically, we assume that SU can choose to hide the actual 
eavesdropper channel gain b and PU has only statistical information about b. Hence we have a non-cooperative 
static game with incomplete information or a Bayesian game flT) . This model is more practical in the sense that 
SU would not be willing to reveal this private information in general. 

As introduced in Section [II] the objective of the players in the Bayesian game is to maximize the expected 
value of their utility payoff functions. Here, PU has a single type while the type of SU is the actual value of 
the eavesdropper channel gain b. The BE depends on the probability distributions that represent the belief of the 
players of the unknown parameters. In this section, we assume that PU believes that the eavesdropper channel 
gain is a realization of a Rayleigh channel. Hence, b is exponentially distributed with mean 6, i.e., = b' with 

-b' 

Pb{b') = e r ■ We assume that the average value b is known to PU. 

The objective of PU in this case is to select a power level to maximize the expected utility payoff function with 
respect to b, which is given by 

E 6 [ui(Pi,a)] 

p OO 1 

= / (C(aPi) - (1 - a)C(6Pi) - jPi) = e^db 
Jo b 

/I \ pOO k 

=C(aPi) - 1 - ' / C(6Pi) e^rdb - 1 P 1 
b Jo 

= c ^-^^r { ^y lPl (32) 

where T(s,a;) is the upper incomplete Gamma function and the last equality follows by |fl9l for Pi ^ 0. It is 
known that for real and positive x, we have 

T(0,x) = -Ei(-x) = Ei(x) (33) 
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where Ei(ai) = ^-dt is the exponential integral function and E\{x) = f°° ^j-dt. The eavesdropping term in 
d32i i is concave and increasing in P\ which has similar structure to the eavesdropping term in the case of known 
eavesdropper channel in previous sections. Hence, it can be seen that Ef,[ui(-)] and the equilibrium of the game 
in this section will have the same structure as their corresponding parts in Sections [nil HVI However, it is hard to 
maximize d32b analytically. Consequently, in the rest of this section, we perform a numerical study to compute and 
compare equilibria of the new game to the case when the eavesdropper channel is known at the PU. 

Let the optimal response of the PU in the hidden b case be Pb{a), i.e., Ph(a) maximizes \u\ (Pi , a)] for 
some given a and b. Note that the function Pb(a) is fixed for a given b while the function P*(a) depends on the 
actual realization of b. In Figure [4] we compare P,(a) to P*{a) for a realization of the eavesdropper channel gain 
b = 0.7 where we also set a = 3, b = 0.7, /3 = 1, 7 = 1, Pi = P\ = 10. It is clear that both curves meet at a = 1 
since the eavesdropping term in ui(-) vanishes at this point where E{,[mi(Pi, 1)] = ui(Pi,l). For a < 1 and a 
given eavesdropper channel realization b, PU uses a higher power level if it only knows b than that used when the 
actual realization of b is known, i.e., Pb(a) > P*(ct) in this case. In general, Pb(a) 7^ P*(a) and hence the iti(-) 
decreases in the hidden b scenario with respect to the known b scenario. 




Fig. 4. Example primary rational reaction curves for the channel realization b = 0.7 and a = 3, b = 0.7, = 1, 7 = 1, Pi = P2 = 10 



Next, we compare the utility achieved by PU and SU in the two given scenarios for different values of the mean 
b and the secondary channel gain c. At the SEP, PU specifies the amount of information allowed for SU, i.e., the 
value of e. In this section, we fix e = 10~ 2 . In Figure [5] the utility of PU is plotted and compared for both cases 
of knowledge of b at the PU, when c = 0.7 and c = 1.3. We compare average performance where the utility is 
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averaged over 10,000 realizations of b. In this comparison, we use Pi = P2 = 5, a = 3, and j3 = 7 = 1. It is clear 
that knowing the exact value of b implies utility that is never less than the other case, i.e., knowledge of b can not 
hurt PU. In addition, the utility in the hidden b scenario approaches that of the known b scenario for larger range 
of b when c is large. This result can be expected from Lemma |2] and Lemma [5J where the effect of eavesdropping 
(and hence b) vanishes when c is large enough so that Q > P*(l). 
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Fig. 5. Utility of PU for revealed and hidden b cases. 



Finally, we study the effect of hiding b on the utility of SU. When c < 0.6, Q < and SU chooses a = for all 
Pi. In this case, U2(-) = for either case of knowledge of b at the PU for all the considered range of b. In Figure 
[6] we plot W2(-) versus b using the same channel and cost parameters as in PU case. Here, it is clear that, for large 
values of b, not revealing the actual value of b improves the utility of SU by confusing PU and lowering down its 
utility. However, for small values of b, SU can in fact decrease its own utility by hiding the actual value of b. 

This last result can be attributed to the following reason. First, when b is small so that actual value of b is small 
with high probability, ui(-) and E[ui(-)] are concave in Pi. Then, the decision of PU is according to (l28l l if SU 
reveals b and similarly if b is hidden. In addition, when c is small so that Q < P'(0) and Q < Pb(0), we have a 
range of b values such that 

«i(P / (0),0)<«i(Q-c,l)<E[« 1 (n(0),0)], (34) 

where in this range, the optimal decision for PU is Pi = Q — e in case of revealed b and Pi = Pb(0) for hidden b 
case. Hence, SU's decision (as a follower) is a = 1 and a = in the cases of revealed and hidden b, respectively. 
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Fig. 6. Utility of SU for revealed and hidden b cases. 



Thus, SU is willing to reveal the actual value of b to achieve better utility in this case. 

VI. Conclusion 

In this paper, we analyzed a possible situation of conflict between a primary user and a cognitive user, each 
communicating with a common destination. Both transmitters are interested in maximizing their own data rate at 
the minimum possible energy. The cognitive half duplex user threatens the primary user to eavesdrop the primary 
traffic if it is not allowed to access the spectrum and transmit its own information. We studied the case when 
the energy cost is linear and we assume that the cognitive transmitter is given a lower priority in the successive 
interference cancellation decoder at the destination. Using tools from non-cooperative game theory as well as 
information theoretic transmission strategies, it is shown that the eavesdropping capability of the secondary user 
forces the primary user to lower down its power level so that the secondary user achieves a non-negative utility. 
However, by using Stackelberg formulation, we show that the primary user specifies the allowable secondary rate 
and the secondary user is forced to comply to this specification, even if its achieved positive utility is small. 
Moreover, the result also holds for the stronger case when the primary channel is weaker than the eavesdropper 
channel. Furthermore, we presented a numerical study for the case when the eavesdropper channel state is known 
only statistically at the primary user. By not revealing information about the eavesdropper channel gain, we show 
that the cognitive user can improve its achieved utility, unless the eavesdropper channel is weak. In our future 
investigations, we will study systems with multiple cognitive eavesdropping users. 
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